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Abstract 



pL| , Let Xij, i,j = 1, ...,n, be independent, not necessarily identically 

r~| ' distributed random variables with finite first moments. We give esti- 

"jrt ■ mates for the expectation of the norm of the random matrix {Xij)^,^-^ . 

We improve a result by R. Latala. 
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<^ 1 Introduction and Notation 

rn 

O 



We study the order of magnitude of the expectation of the largest singular 
value, i.e. the norm of random matrices with independent entries 



^{\\iai,39i,j)lj=i 



2-->2 



o3 . where a^j G M, i,j = l,...,n, gij, i,j = l,...,n, are standard Gaussian 

random variables and || ||2^2 is the operator norm on £2- There are two cases 
with a complete answer. Chevet [2] showed for matrices satisfying Oj .,■ = aibj 
that the expectation is proportional to 

llalhll&lloo + ||a||oo||&||2, 
where ||a||2 denotes the Euclidean of a = (ai, . . . , a„) and ||a||oo = niaxi<j<„ |aj 
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For diagonal matrices with diagonal elements di, . . . ,dn we have that the 
expectation of the norm is of the order the Orlicz norm ||((ii, . . . , (i„)||A/ where 

the Orlicz function is given by M{s) = J^ j^ e^^ dt [5]. This Orlicz norm 
is up to a logarithm of n equal to to the norm maxi<j<„ \di\. 

These two cases are of very different structure and seem to present essen- 
tially what might occur concerning the structure of matrices. This leads us to 
conjecture that the expectation for arbitrary matrices is up to a logarithmic 
factor equal to 



max 

1 = 1,. ..,71 



^i,j)j=l 



+ max I 

2 j=l,...,n 



[(^i,j>i=l\\2 



Latala [Ij showed for arbitrary matrices 



E 



fii,j9i,j)i,j=l 



2-S.2 



< 



max 

i=l,...,n 



'.,3)j=l 



+ max ||(ai,j)r=ill2+ 



■,])i,j=l 



Seginer [TT] showed for any n x m random matrix (Xj j)"^™^ of independent 
identically distributed random variables 



E 



(^.. 



n,m 
3H,j=l 



< c E max 

2-^2 ~ V l<i<n 



i^id)T- 



+ E max 

2 l<i<n 



i^i,j)T- 



The largest singular value was first investigated by [121 [13] • The behavior of 
the smallest singular value has been determined in [H [51 [Zj- 

Theorem 1.1. There is a constant c > such that for all aij G M, i,j = 

l,...,n, and all independent standard Gaussian random variables Qij, i,j = 
l,...,n, 




E I max 

i=l,...,n 



y(^i,j9i,j)j=l 



+ E max ||(aijc/ij)^=i||2 

2/ \]=l,...,n 



In the same way we prove Theorem 11.11 we can show the similar formula 

[0'i,j9i,j)i,j=l 




2^2 



,0'i,j)ij=l 



max 

i=l,...,n 



,a,; 



,jjj=l 



+ max ||(a 

2 j=l,...,n 



hjJi=l\\2 



This inequality is generalized to arbitrary random variables as in |3]. 



Theorem 1.2. Let Xij, i,j = l,...,n, be independent, mean zero random 
variables. Then 



i?(||(X„-)i:,=J J <c(lnH)ME max ||(X„-)^=J + E max ||(X,,)r=i 

\ '-' 2-^2/ \ 1=1,. ...n ■' 2 1=1,. ...n 



Since E f {ai,jg. 



Ji,j)i,j=l 



2-s>2 



is up to a logarithmic factor equal to ([T]) we 



investigate better estimate from below. On the other hand, 



E I max 

,i=l,...,n 



{ai,j9i,j)]=i\\^j + E (^^max^ \\{at,j9i,j)7=i\\2) (2) 



is obviously smaller than E ( io,i,j9i,j)i,j=i )• We show that the expression 
([2]) is equivalent to the Musielak-Orlicz norm of the vector (1, . . . , 1), where 
the Orlicz functions are given through the coefficients ajj, i,j = 1, . . . ,n. Our 
formula (Theorem I3.1I) enables us to estimate from below the expectation of 
the operator norm in many cases efficiently. 

Moreover, we do not know of any matrix where the expectation of the 
norm is not of the same order as ([2D • 

A convex function M : [0, cxd) — > [0, cxd) with M(0) = is called an Orlicz 
function ^. Let M be an Orlicz function and x G M" then the Orlicz norm 



of X, ||x||^, is defined by 



\M 



inf U > 



Jj4 



EM^ <1 



i=l 



We say that two Orlicz functions M and A^ are equivalent (M ~ A^) if there 
are strictly positive constants Ci and C2 such that for all s > 

M(cis) < N{s) < M{c2s). 

If two Orlicz functions are equivalent, so are their norms: For all x G M" 

ciII^IIm < \\x\\n < C2||a;||M- 

In addition, let Mj, i = 1, ...,n, be Orlicz functions and let x G M" then 
the Musielak-Orlicz norm of x, ||a;||/j^,^ y, is defined by 



X 



(M,), 



inf U > 



Y.M. 



1*^2 I 



< 1 



■i=i 



2 The upper estimate 

In this section we are going to prove the upper estimate. We require the 
following known lemma. In a more general form see e.g. ([ID], Lemma 10). 

I n—l 

Lemma 2.1. Let x^''^ = 4j(l, ..., 1, 0, ..., 0), / = l,...,n, and let B^ be the 

convex hull of [£ixlL\, ■■■^^nxJn)) ' '^here Si = ±1, i = 1, ..., n, and n denote 
permutations of {1, ...,n}. Let \\ \\t be the norm on M" whose unit ball is 
Bt. Then, for all x G M" 



WAh ^ ll^^llr < vln(e?7,)||a;||2. 

Proof. Let x G W^. Then x*, . . . ,a;* denotes the decreasing rearrangement 
of the numbers |xi|, . . . , |x„|. Let a^ = yk — \/k — 1 for fc = 1, . . . ,n. Then, 
for all x G M" 

n 

II^IIt = 5I'^fc("^~ VA; — 1). 

k=\ 

Since ^Jk — \/k — 1 < 4^ 

IIxIIt < ("e |v^ - v/^^ir) ' ||x||2 < (E ^) ' ll^lb < \/ln(en)||a;||2 

D 
We denote 



D-p — s X — \X\i ■■■■, Xn) G O 



3i= 1, 



1, ...,n|a;,- = ±—^ 



1 



Then by our previous lemma we have 



2^2= sup ||Ax||2 < i/ln(en) sup \\Ax\\^. (3) 

We use now the concentration of sums of independent gaussian random 

variables X = J2i=i9i^i i^ ^ Banach space ( |5], Theorem 4.7): For all t > 

P{|||X||-E||X||| > t} <2exp{-Kt^/a{Xf), (4) 

where K = ^ and 

a{X)=snp fEie(^OI'V- (5) 
The following lemma is an immediate consequence. 



Lemma 2.2. For all i,j = 1, ..., n let aij G M, let gij he independent stan- 
dard Gaussians, G = {O'ijgij)^ j=i o-iT-d let x G B2 . For all (5 > 1 and all x 

with max Yj cifj^'] > we have 

i=l,...,n \j=l 'J -I 



P ||G'2;||2 > /3 E max (aijgij)'!^^ J+^[ max IKaij^ijO^tilla 

\ \ \i^l,...,n •* IJ \j=l,...,n 

/3 (^E (^.max^ ||(aij^i,j)^=iy + E (^^max^ Il(ai,i^ij)r=iii2 



< 2exp 



i\^ 



-K- 



2_, (^ij^j 



v 



max X; 0.? x^ 

i=l,...,n \ ,=1 '■' 



where K is the constant from 



Please note that 



1^X112^2 < I E 



2 2 



o"(Gx) = max ^ 

4=1. ..,n \ -^ 



2 2 



Proposition 2.3. For a// z,j = 1, ...,ra /et ajj G M, /et gfjj &e independent 
standard Gaussian random variables and let G = (aj,j5'ij)",=i- For all (5 with 



IP (11^112^2 > /31n(n) f E ( .max^ {.ai^jgi,jTj=i ^ + .'^^^J\i.o-i,j9i,j\ 
<2Eexp /ln(2n)-ii'/- * 



«=lll2 



/=1 



TT 



where G is an absolute constant. Furthermore, we get for (3 such that fi'— 
31n(2n) and K = ^ 



P|I|G||2^2> 



1 
< . 



/37r3 



— In(en) E max (ai^gij)''^^ + max Wiaijgij)^^^]]^ 

4 \ \t=l,...,n •' ■' J I j_]^^ ,j 



Proof. We shall apply Lemma [221 We may assume that max ( J2 (^fj^"] ) > 
0. By© 



1 1 ' — ' 1,1 1 

j=l,...,n \ j—\ '■' -" 



II (^ 1 1 2^2 < V In (en) sup ||Gx||2. 



xdff' 



Therefore, for /3 G M>o, we have 



IP (11^112^2 > /3\/ln(en) (E ( max (aijgij)1^^ + max \\iaijgij)1^^\\ 
< P sup 11(^x112 > /3 (e ( max (aijgij)'^^^ + max ||(ai,j5'ij)r=ill2 



For all / = 1, ..., ra let Mi be the set of x^^^ G ^^"\ such that xf G {0, ±^} 
for all j = 1, ...,n. Now we apply Lemma [2.21 and get 



P 11^112^2 > I3\/Hen) E max (a^jg^j)]^^ + max ||(aij^,j)^=i 

\ \ \i=l,...,n -^ 2 j=l,...,n 



<P sup IIGxL > /3 E max 



^xes" 



i=l,...,n 



(^ij9ij)]=i + max ||(aij^ij)^=i||2 

•* z 7=1,. ...n 



<E E P^GxW ^>/3(^E(^^max^ (a,,^,,)^=i ^+^maxj|(a„a^^ 

I ( 



'=1 xW&Mi 



-K-l 



/?■ 



E I max 

i=l,...,n 



aijgij)]=i + max ||(aij^ij)^=i||2 

■^ z 1=1,. ...ra 



E a?. 



max 



n'\ 



7! I ? ^ «l- 



max I Y> «i, 



By 



-77 E E 4^ "^^,^ IIK-)r=ill2< max ||(a,,)?=ill2 



-j6{fc|x^'V0} 



and 



E max ||(ay^,j)^=i||2 > \ - max ||(a,j)^=i||2 

j=l,...,n ^ y TT j=l,...,n 



(6) 



we have for all (5 with (5 > ,/f 



/3E jamxj\{aijgij)i\\^ 



Vi 



t E 41 >o- 



Thus 



P l|G'll2^2 > /3vln(en) E max (aijg.j)'^^^ + max IKflij^jjOlLilla 

\ \ \i=l,...,n -^ I 7 = 1,. ...n 



<2E E exp 

z=i xWeMj 



/ 


/ 


-ir 


/ 


v 


\ 



/3 



E I max 

i=l,...,n 



y^i,j9ij) j=\ 



max I X] «^ 



*='••••'" Vj-6{fci4'Vo} 



«J 



Again, by (jH]) we have for all /3 with /3 > , /| 



P l|G'll2^2 > /3vln(en) E max (aij^»j)"=i ^ + max \{ai,jg,^j)1^^\ 

\ \ \2=1,.--)''T' " ''I _7=l,...,n 

<2X: E expf-ir/^"] <2X:2Wexp('-ir/^'j 



=1 xWeM, 



1=1 



2^exp mn(2n) - i^/ 



«=i 



2/32 



TT 



We choose (3 such that 31n(2n) = K^. Then 
P ('||G'||2^2 > \/3 ln(2n) ln(n) ('e f max ||(a,j^ij)j=i 
< 2 ^ exp (/ ln(2n) - 3/ ln(2n)) = 2 ^ exp {-21 ln(2n)) = 2 J] ( ^ j 



+ max ||(ay^ij)^=i||2 



«=i 



«=i 



«=i 



^ V4nV _ , 



-'- 4n2 



4?7,2 — 1 n^ 



D 



Proposition 2.4. Let aij G M, ^,j = l,...,n, and Qij, i,j = l,...,n, be 
independent standard Gaussian random variables, then 



e(||( 



y^i,j9i,j)i,j=l 



2-S.2 



< 1 + 



/37r3 



— In(en) E max {ai^jgi^jfj^^ + max \\{aijgi,j)l 



=lll2 



Proof. We divide the estimate of E ( iaijgi,j)^j=i ) into two parts. Let 
M be set of all points with 



\^i,j9i,j)i,j=l 



2->2 



/3vr3 , , ,^ / 
< W mfenjE max 

~ V 4 \i=l,...,n 



\fli,j9' 



jyi,3jj=i 



+ max ||(aijc/ij)^=i||2 

2 j=l,...,n 



Clearly, 



E(||(aij^M)"j=i 



2-^2 



Xm 



< 



/37r3 



— ln(en)E max (aij^fijO^^^ + max \\{aijgij)'l^^\\^ 

41: yi^l,...,?! -^ z j=l,...,n 



Furthermore, by Cauchy-Schwarz inequality and Proposition 12.31 we get 



^{\\KJ9^,)l,=l\l^^XM^) <v/p(m^)(e (II K, (?,,,) •,=1 



2 \ \ 2 

2^2, 



< - (E 



[(^i,j9i,j)i,j=l 



2-^2 



-"1/ Sis '"•*''] 



1 



n 



E 



"iji 



Besides, we obviously have 



< max y^ a j 1 
- l<i<n I ^ ' '^' 



max 

i=l,...,n 



■^u/i=i 



< E I max 

I j=l,...,n 



+ max \\{aij)i^^\\^ 

j=l,...,n 

{aij9ijTj=i + max ||(aij^ij)^=i||2) . 
Altogether, this yields 

E (||(aij^i,j)u=i||2^2 ^*^') - ^ Ci^^\ ||(ai,j^i,i)i=i||2 + ^max^ ||(ai,j^ij)r=ill2 
Summing up, we get 



E 



y^i,j9i,j)i,j = l 2_^2/ 



< 1 + 



^ln(en) E ( max (ai^gij)]^^ + max \\{aijgij)'l^^\\^ 

4 / \i=l,...,n -^ 2 j=l,...,n 



D 



Proof. (Theorem II. ip W.l.o.g. we assume ajj < 1, i,j = 1,...,?t,, and that 
there is a coordinate that equals 1. For alH, j = 1, ...,n and k eN we define 



«j 



2fc 





li 2*; ^ "ij _ 2*-l 

else. 



Let G = (ajjfl'jj)"j=i and G^ = (a.fjfi'j,j)"j=i- We denote by (j){k) the num- 
ber of nonzero entries of the matrix (af •)" -^^^ and we choose 7 such that 

>*,i)ri=i , = 2^ («.j)m=i • Thus, we get (P{k)jjr = £ a^j < 2^ and 

therefore 0(A;) < 2'^"'''''. Therefore, the non-zero entries of G'^ are contained 
in a submatrix of size 2^'^'^ x 2^"'"'^. Taking this into account and applying 
Proposition [23] to G^ 



E 



G'' 



2->2 



< 1 + 



/37r3 



ln(e2'=+^) E( max 



i=l,...,n 



,(^Li9 



«,iy*J''i=i 



-|- max 

2 j=l,...,n 



K^i,j9i,j)i=l 



< im{k + 7) I E $: I4^.^,,,f I < im{k + 7)2 



2 2 



Therefore, 



y E G"= < 140 y ^v^ < 280 y 4 

fc>27 fc>27 ^"' fc=l ^* 



Since one of the coordinates of the matrix is 1 



E 



G^ 



> 



2^2 



\g\dt 



Therefore, there is a constant c such that 



y G* 

k<2y 



E\\G\\,^,<2E 
The matrix J2k<2'y G^ has at most 



+ 2y l|G"=||2^2 < cE 

2^2 '=>27 



y G'' 

k<2j 



2^2 



A;<27 A;<27 



'jJij=l 



^^i,j)i,j=l 



(7) 



entries that are different from 0. Therefore, all nonzero entries of J2k<2'y G^ 
are contained in a square submatrix having less than ([7]) rows and columns. 



We may apply Proposition 12.41 and get with a proper constant c 



Ei\\G\\,^,)<c\l 



/37r3 



/ 



In 



\'^i,j)i,j=l 



E I max 

j=l n 



n 



X 






+ max 

j=l,...,n 



E 4j9i,j 






D 



3 The lower estimate 

Theorem 3.1. For all i,j = l,...,n let aij G M and Qij be independent 
standard Gaussians. For all s G ]R>o and for all i = 1, ..., n let 



N,(s) = I 



s max a,- i e 

i=l,...,n' *'^' 



max a . . 
j^l,...,n *'■? 



max Ittj^jl 
j^l,...,n 



'^ i,J ' J— 1 



max a . 



'■' + 



.ai. 



3)j=l 






'^ \\i-^.)U\ 



' "^ — lICo' ■■)" II ' 



respectively let for all s G R>o and for all j = 1, ..., n 



Njis) 



Then 



s max a,- o e 



max 
i = l, 






i=l,...,n 



«JI 



K.,)' 



max I Oij I 

1^1, ,n 

\\(n 0" II 
, ||'>"'»J''i=l|l2 



max a. 



1=1, ...,n 'J _|_ 3 



(a* 



.i/'j=ill2 



s — 



||K.)Li| 



S < -n r 

V *J /J — 1 



■'- IIK.L1I; 



(a*,i^i,j)i=i||2 + ^max^ II (a,j^,j)^^i| 



< E max 

\i=l,...,n 



<C2 



where c\ and C2 are absolute constants. 



The following example is an immediate consequence of Theorem I3.1[ It 
covers Toeplitz matrices. 



10 



Example 3.2. Let A be a n x n-matrix such that for all i,— 1 . . . , n and k — 1, . . . , n 









Then 



max \ai_j\ = max \aj^k\ 
l<j<n ' i;f^j5;^ 



E max (a,jg,j)"^i + . max I!(a»,i5'»,j)r=ill2 

\2— 1, ,n ■' ^ J — 1, ,^ 



max < I 2J kijl 



, V Inn max |aij| 



We associate to a random variable X an Orlicz function M by 



M(s) 



|X|dPdt. 



T<l^l 



We have 



M(s) = / / \X\d^dt 



-P^IXI > -] + / P(|X| >u)du 
1 



dt. 



(8) 



(9) 



Lemma 3.3. There are strictly positive constants Ci and C2 such that for all 
n G N, all independent random variables Xi, ..., X„ wt/i finite first moments 
and for all x G M" 

Ci ||a;||/A,f A < E max IxjXjl < Co IklL.,, N , 

where Mi,...,M„ are i/ie Orlicz functions that are associated to the random 
variables Xi, . . . , X„ ([^j. 



Lemma IX^ is a generalization of the same result for identically distributed 
random variables [3] • It can be generalized from the £oo-norm to Orlicz norms. 
We use the fact [9J that for all s > 



/27r 



- l)x + Vx^ + 2 



TT 



TT 



, 00 1 o I 1 , 



e 2 < ,/^ / e 2 ds < \ --€ 2 



(10) 



11 



Proof. (Theorem I3.ip We apply Lemma 13.31 to the random variables 



\3=l 



i = 1, . . . ,n. 



Now, it is enough to show that Mj ~ A^j for all i = 1, . . . , n. We have two 
cases. 

We consider first s < | ( E (Z]j=i '^i,j9i,j) ^ ) • There are constants Ci, C2 > 



such that for all u with m > 2E Y. o-iiS, 



2 „2 



Vi=i 



«Ji'«J 



exp —e 



u 



max af 

j=l,...,n " 



<P \T.<j9h] >« <exp - 



u 



Co 



"2 2 

I ,-1 / ; \ inax af , 



(11) 

The right-hand side inequality follows from (jlj) . The left-hand side inequality 
follows from 

n 

E2 2 \ 2 2 

i=i 
Since | > 2E X^ '^'i,j9i,j ] ' ^^ '"^^ ripply (ITT]) . Therefore, 



^ / ^ 7 exp I ,2 2 

t It"' max af 



j=l,...,n 



«J 



■u 



exp -C2- 



C?M 



max a 



j=l,...,n 



«J 



rft. 



By (HDD 



M,,(s) < 



+■ ^^P I +2 2 

r I t^ max af 



+ t max Oj ,exp 



j=l,...,n 



«J 



j=l,...,n 



«J 



t^ max a,^ 



dt. 



jr=l,...,n 



«J 



Since i>2EfEa2^.(?2V> 



'j;j=i 



we get 



1 2 1 /t^ 

- + t max a^ . <- + J- 
t j=iv,™ '"^ t V 2 



{0'i,j)]=l 



'.,j)j=i 



2 3 

< -. 

2 - t 



12 



Thus, 

Altogether, we get 

s 

Mi{s)< J -exp 



11 2 3 

- < — \-t max a,- „• < -. 

t ~ t j=i,-,n ''^ - t 



( 



^2 



. t^ max of 



/■3 
at = — exp 



c^u2 



max a; 

j=l,...,n 



du. 



«J 



Passing to a new constant C2 and using (ITUl) we get for all s with < s < 



CXJ 

Mi{s) < 3 /"exp 



C^M^ 



max a 

j=l,...,n 



7^ \ du < — ( max I a j ,• I ) exp 



Co 



s^ max a? 

j=l,...,n " 



(12) 
From this and the definition of Ni we get that there is a constant c such that 

for all s with < s < i f E (e"=i <i^'j) ' 

Mi{s) < N,{cs). 
Indeed, the inequality follows immediately from flT21) provided that — < 



i /^V" 1(7- P^ ^ Tf ^ fv" 



"^ji 



2\ 2 



i\ -1 



C2 



<.<i E(E^=ia2^.<7yM then 



by m and V?maxi<,<„a,,, < E (E"=i <,^i^j)', 



Mi(s) < 



Moreover, 



2 max a 

j=l,...,n 



«J 



C2E (e^i a^^^l 



exp 



^•r 



c^(e(E^,<,4,)V^ 



4 max a 

j=l,...,n 



«J 



< 



/27r 



C2 



^ 



<Ni 



^ 



+ 1 \\(a 



\n 11—1 
i,j)j=l\\2 



C2 VV C2 y 

Therefore, with a universal constant c the inequality Mi{s) < Ni{cs) also 
holds for those values of s. The inverse inequality is treated in the same way. 



i\ -1 
Now we consider s with s > 5 ( E ( X) o.'ijQij 



and denote a 



13 



E E a^igfi ■ The following holds 



vi=i 



«j^«j 



M(.) = / Up f E aL^^J > ^ + / P f E aL^L J > « U4 rft 



2a 



/ > |:4.4, >7h/' |:44 S"N4* 



+ j\\r\[t<d/j >jUJr\{t4d,j >-U4dt 



By (1121) the first suminand is of the order 

/ 

max I a. 



j=l,...,n 



"J J I 



E E o^.,g? 



-exp 

2 



vi=i 



«j^«j 



E E<.^^. 



IN 2\ 

2 \ \ 



u=i 



max a 

j=l,...,n 



«J 



v 



/ 



We estimate the second summand. The second summand is less than or 
equal to 



JU+^iX^yij Ut<|3EfEaJ^.4J dt < 3E i^^alglA s 



i\ -1 



Therefore, with a universal constant c we have for all s with s > ^ E J2 of jg. 



2 „2 



vi=i 



iJ^V 



M,(s)<(c-i)s EI^M-n <cnE 



nji 



1 < iVi(cs). 



Vi=i 



Vi=i 



Now, we give a lower estimate. By ([H]), for all s with s > 



M,{s)>J J E«L-^S, rfprfi>/ / E«L4 ^^^^- 



t — \ Z—/ iiJ "■ 



j=i 



2 



1 \i=i 

5 



f^ E4.9; 



j=i 



2 



1 \i=i 
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By the definition of a 



Vi=i 



u 



u=i 



-E Yl <d, s-2\e\y: 4d, 



V 



Vi=i 




]^it.<j9-A s-1. 



u=i 



Tlie rest is done as in tlie case of tlie upper estimate. 



D 
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